Geely Auto has recently achieved a significant breakthrough in the field of speech synthesis, with its self-developed HAM-TTS large model outperforming the industry benchmark VALL-E, attracting widespread attention in the industry. This AI large model, named "Xingrui," has achieved significant improvements in key indicators such as pronunciation accuracy, naturalness, and speaker similarity. The HAM-TTS model utilizes token-based zero-shot text-to-speech hierarchical acoustic modeling technology, greatly enhancing the user interaction experience in intelligent cockpits. Under the same conditions of 400 million parameters, HAM-TTS...